Arousal and valence prediction in spontaneous emotional speech: felt versus perceived emotion
نویسندگان
چکیده
In this paper, we describe emotion recognition experiments car ried out for spontaneous affective speech with the aim to com pare the added value of annotation of felt emotion versus an notation of perceived emotion. Using speech material avail able in the TNO-GAMING corpus (a corpus containing audio visual recordings of people playing videogames), speech-based affect recognizers were developed that can predict Arousal and Valence scalar values. Two types of recognizers were devel oped in parallel: one trained with felt emotion annotations (generated by the gamers themselves) and one trained with perceived/observed emotion annotations (generated by a group of observers). The experiments showed that, in speech, with the methods and features currently used, observed emotions are easier to predict than felt emotions. The results suggest that recognition performance strongly depends on how and by whom the emotion annotations are carried out.
منابع مشابه
Empirical Study of Dimensional and Categorical Emotion Descriptors in Emotional Speech Perception
The dynamic between speaker intent and listener perception is played out in the variation of acoustical cues by the speaker that must be interpreted by the listener to determine in an appropriate way. Emotion speech research must rely on either acted intent (i.e., an actor attempting to express an emotion) or listener perception (i.e., listening tests to assign emotional categories to non-acted...
متن کاملA Framework of Human Emotion Prediction Based on a Multi-Dimensional Emotion Model
In this paper, we propose a framework of emotion prediction based on a multi-dimensional emotion model using object similarity. The proposed framework predicts the user's level of emotional response to an emotional stimulus, based on a sensitivity database that consists of self-assessment manikin-based integer-scale data, rated on various object stimulations by many users. We experimented on 72...
متن کاملAn Investigation of Emotion Dynamics and Kalman Filtering for Speech-Based Emotion Prediction
Despite recent interest in continuous prediction of dimensional emotions, the dynamical aspect of emotions has received less attention in automated systems. This paper investigates how emotion change can be effectively incorporated to improve continuous prediction of arousal and valence from speech. Significant correlations were found between emotion ratings and their dynamics during investigat...
متن کاملHow emotional auditory stimuli modulate time perception.
Emotional and neutral sounds rated for valence and arousal were used to investigate the influence of emotions on timing in reproduction and verbal estimation tasks with durations from 2 s to 6 s. Results revealed an effect of emotion on temporal judgment, with emotional stimuli judged to be longer than neutral ones for a similar arousal level. Within scalar expectancy theory (J. Gibbon, R. Chur...
متن کاملSpeech-based recognition of self-reported and observed emotion in a dimensional space
The differences between self-reported and observed emotion have only marginally been investigated in the context of speech-based automatic emotion recognition. We address this issue by comparing self-reported emotion ratings to observed emotion ratings and look at how differences between these two types of ratings affect the development and performance of automatic emotion recognizers developed...
متن کامل